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Abstract —For general connections, the problem of finding net¬ 
work codes and optimizing resources for those codes is intrinsically 
difficult and little is known about its complexity. Most of the 
existing solutions rely on very restricted classes of network codes 
in terms of the number of flows allowed to be coded together, 
and are not entirely distributed. In this paper, we consider a 
new method for constructing linear network codes for general 
connections of continuous flows to minimize the total cost of 
edge use based on mixing. We first formulate the minimum- 
cost network coding design problem. To solve the optimization 
problem, we propose two equivalent alternative formulations with 
discrete mixing and continuous mixing, respectively, and develop 
distributed algorithms to solve them. Our approach allows fairly 
general coding across flows and guarantees no greater cost than 
any solution without inter-flow network coding. 

I. Introduction 

In the case of general connections (where each destination 
can request information from any subset of sources), the 
problem of finding network codes is intrinsically difficult. Little 
is known about its complexity and its decidability remains un¬ 
known. In certain special cases, such as multicast connections 
(where destinations share all of their demands), it is sufficient 
to satisfy a Eord-Eulkerson type of min-cut max-flow constraint 
between all sources to every destination individually. Eor mul¬ 
ticast connections, linear codes are sufficient [1], [2] and a 
distributed random construction exists [3]. In the literature, 
linear codes have been the most widely considered. However, 
in general, linear codes over finite fields may not be sufficient 
for general connections [4]. In addition, even when we consider 
simple scalar network codes (with scalar coding coefficients), 
the problem of code construction for general connections (i.e., 
neither multicast nor its variations) remains vexing [5]. The 
main difficulty lies in canceling the effect of flows that are 
coded together but not destined for a common destination. 

The problem of code construction becomes more involved 
when we seek to limit the use of network links for reasons 
of network resource management. In the case of multicast 
connections of continuous flows, it is known that finding a 
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minimum-cost solution for convex cost functions of flows over 
edges of the network is a convex optimization problem and 
can be solved distributively using convex decomposition [6]. In 
the case of general connections of continuous flows, however, 
network resource minimization, even when considering only 
restricted code constructions, appears to be difficult. 

In general, there are two types of coding approaches for 
optimizing network use for general connections. The first type 
of coding is mixing, which consists of coding together flows 
from sources using the random linear distributed code construc¬ 
tion of [3] (originally proposed for multicast connections), as 
though the flows were parts of a common multicast connection. 
In this case, no explicit code coefficients are provided and 
decodability is ensured with high probability by the random 
coding, given that mixing is properly designed. Eor example, 
in [7], a two-step mixing approach is proposed for network 
resource minimization of general connections, where flow par¬ 
tition and flow rate optimization are considered separately. In 
[8], we introduce linear network mixing coefficients and present 
a new method for constructing linear network codes for general 
connections of integer flows to minimize the total cost of edge 
use. The minimum-cost network coding design problem in [8] 
is a discrete optimization problem, which jointly considers 
mixing and flow optimization. The second type of coding 
is an explicit linear code construction, where one provides 
specific linear coefficients, to be applied to flows at different 
nodes, over some finite field. In this case, the explicit linear 
code constructions are usually simplified by restricting them 
to be binary, generally in the context of coding flows together 
only pairwise. Eor example, in [9] and [10], simple two-flow 
combinations are proposed for network resource minimization 
of general connections. 

The flow rate optimization in [7], the joint mixing and flow 
optimization in [8], and the joint two-flow coding and flow 
optimization in [9], [10] can be solved distributively. However, 
the separation of flow partition and flow rate optimization in [7] 
and the pairwise coding in [9], [10] lead in general to feasibility 
region reduction and network cost increase. In [8], we do not 
allow flow splitting and coding over time, leading to coded 
symbols flowing through each edge of the network at an integer 
rate. The restriction of integer flow rates affects the network 


cost reduction. 

In this paper, we consider a new method for constructing 
linear network codes to minimize the total cost of edge use 
for satisfying general connections of continuous flows. We 
generalize the linear network mixing coefficients introduced 
in [8]. In contrast to [8], we allow flow splitting and coding 
over time to further reduce network cost. Using mixing with 
generalized mixing coefficients, we formulate the minimum- 
cost network coding design problem, which is an instance 
of mixed discrete-continuous programming. Our mixing-based 
formulation allows for fairly general coding across flows, offers 
a tradeoff between performance and computational complexity 
via tuning a design parameter controlling the mixing effect, 
and guarantees no greater cost than any solution without inter¬ 
flow network coding. To solve the mixed discrete-continuous 
optimization problem, we propose two equivalent alternative 
formulations with discrete mixing and continuous mixing, 
respectively, and develop distributed algorithms to solve them. 
Specifically, the distributed algorithm for the discrete mixing 
formulation is obtained by relating the optimization problem to 
a constraint satisfaction problem (CSP) in discrete optimization 
and applying recent results in the domain [11]. The distributed 
algorithm for the continuous mixing formulation is based on 
penalty methods for nonlinear programming [12] in continuous 
optimization. 

II. Network Model and Definitions 

In this section, we first illustrate the network model for 
general connections of continuous flows. The model is similar 
to that we considered in [8] for integer flows, except that 
here we consider general flow rates and edge capacities, and 
allow flow splitting and coding over time. Next, to facilitate 
the understanding of the formulations proposed in Sections III, 
V, and IV, we also briefly illustrate the formal relationship 
between linear network coding and mixing established in [8]. 

A. Network Model 

We consider a directed acyclic network with general con¬ 
nections. Let Q = (V,£) denote the directed acyclic graph, 
where V denotes the set of U = |V| nodes and £ denotes the 
set of E = \£\ edges. To simplify notation, we assume there 
is only one edge from node i € V to node j € V, denoted 
as edge (l,j) € £} For each node i G V, define the set of 
incoming neighbors to be li = {j : {j,i) G £} and the set of 
outgoing neighbors to be Oi = {j : {i,j) G £}. Let li = \Ii\ 
and Oi = I Oil denote the in degree and out degree of node 
i GV, respectively. Assume h < D and Oi < D for all i G V. 
Let V = {1, • • • ,P} denote the set of P = \V\ flows to be 
carried by the network. For each flow p G V, let Sj, S V be 
its source. We consider continuous flows. Let Rp G K’*' denote 

'Multiple edges from node i to node j can be modeled by introducing 
multiple extra nodes, one on each edge. 


the source rate for source p, where IR+ denotes the set of non¬ 
negative real numbers. Let S = {si, • • • ,sp} denote the set 
of P = jiSj sources. To simplify notation, we assume different 
flows do not share a common source node and no source node 
has any incoming edges. Let T = {p, • • • , fp} denote the set 
of T = |7”| terminals. Each terminal t gT demands a subset 
of Pt = \Vt \ flows Vt L V. Assume each flow is requested by 
at least one terminal, i.e., = V. To simplify notation, 

we assume no terminal has any outgoing edges. 

Let Bij G R"*" denote the edge capacity for edge (i,j). Let 
Zij G [0,Bij] denote the transmission rate through edge (i, j). 
We assume a cost is incurred on an edge when information 
is transmitted through the edge. Let Uij{zij) denote the cost 
function incurred on edge {i,j) when the transmission rate 
through edge {i,j) is Zij. Assume Uij{zij) is convex, non¬ 
decreasing, and twice continuously differentiable in Zij. We 
are interested in the problem of finding linear network coding 
designs and minimizing the network cost J2{i j)£S 
general connections of continuous flows for those designs. 

B. Scalar Time-Invariant Linear Network Coding and Mixing 

For ease of exposition, in this section, we illustrate linear 
network coding and mixing by considering unit flow rate, unit 
edge capacity and one (coded) symbol transmission for each 
edge per unit time, and adopt scalar time-invariant notation. 
Later, in Sections III, V, and IV, we shall consider general 
flow rates and edge capacities and allow flow splitting and 
coding over time, which enable multiple (coded) symbols to 
flow through each edge at a continuous rate. 

Consider a finite field P with size F = |P|. In linear 
network coding, a linear combination over P of the symbols in 
{(Tki G F : k G Xi] from the incoming edges {{k, i) : k G F}, 
i.e., aij = ctkijO'ki, can be transmitted through the 

shared edge (i, j), where coefficient akij G P is referred to 
as the local coding coefficient corresponding to edge {k,i) C 
£ and edge {i,j) S £■ On the other hand, the symbol of 
edge (i,j) G £ can be expressed as a linear combination 
over P of the source symbols {ap G F p G V}, i.e., 
aij = Cij^ap, where coefficient Cij^p S P is referred 

to as the global coding coefficient of flow p G V and edge 

j) ^ Fet Cij — (czy,!; ‘: ^ij,pi * ‘; t^ij,p^ C F denote 
the P coefficients corresponding to this linear combination for 
edge {i,j) € £, referred to as the global coding vector of 
edge {i,j) S Note that, we consider scalar time-invariant 
linear network coding. In other words, akij € P and Cij^p G F 
are both scalars, and akij and Cij^p do not change over time. 
When using scalar linear network coding, for each terminal, 
extraneous flows are allowed to be mixed with the desired flows 
on the paths to the terminal, as the extraneous flows can be 
cancelled at intermediate nodes or at the terminal. 

In many cases, we shall see that the specific values of the 
local or global coding coefficients are not required in our 
design. For this purpose, we introduce the mixing concept 


based on local and global mixing coefficients established in [8]. 
Specifically, we consider the local mixing coefficient Pkij € 
{0,1} corresponding to edge (k,i) € S and edge (t,j) € S, 
which relates to the local coding coefficient akij G -F as 
follows. Pkij = 1 indicates that symbol aki of edge (fc, i) G F is 
allowed to contribute to the linear combination over T forming 
symbol ( 7 ^ and j3kij = 0 otherwise. Thus, if fikij = 0, we 
have akij = 0 (note that akij can be zero when Pkij = !)• 
Similarly, we consider the global mixing coefficient Xij^p £ 
{0,1} of flow p € V and edge {i,j) G £, which relates to 
the global coding coefficient Cij^p G T as follows. Xij^p = 1 
indicates that flow p is allowed to be mixed (coded) with 
other flows, i.e., symbol dp is allowed to contribute to the 
linear combination over T forming symbol dij, and Xij^p = 0 
otherwise. Thus, if Xij^p = 0, we have Cij^p = 0 (note that 
Cij^p can be zero when Xij^p = 1). Then, we introduce the 
global mixing vector x,y = - ,x,j^p,--- ,Xij^p) G 

{0,1}^ for edge (f, j) G £, which relates to the global coding 
vector Cy = ,Cij^pp-- ,Cij^p) G . Similarly, we 

consider scalar time-invariant linear network mixing. That is, 
Pkij G {0,1} and Xij^p G {0,1} are both scalars, and Pkij and 
Xij^p do not change over time. 

Linear Network Coding Linear Network Mixing 



Fig. 1: Comparisons between linear network coding and linear 
network mixing. Given the two global coding vectors on the 
left, we can tell that the (red) symbol can be decoded in the 
hrst case and cannot be decoded in the second case. However, 
given the two global mixing vectors on the right (same for the 
two cases on the left), it is not sufficient to tell whether the 
(red) symbol can be decoded or not. 

Global mixing vectors provide a natural way of speaking of 
flows as possibly coded or not coded without knowledge of the 
specihc values of the global coding vectors. Intuitively, global 
mixing vectors can be regarded as a limited representation of 
global coding vectors. Network mixing vectors may not be 
sufficient for telling whether a certain symbol can be decoded or 
not, as illustrated in Fig. 1. Therefore, using the network mixing 
representation, extraneous flows which are mixed with the 
desired flows on the paths to each terminal, are not guaranteed 
to be cancelled at the terminal. Let Op denote the vector with 
the p-th element being 1 and all the other elements being 0. 
Let V denote the “or” operator (logical disjunction). We now 
dehne the feasibility for scalar linear network mixing. 

Definition 1 (Feasibility of Scalar Linear Network Mixing): 
For a network Q = (V,£’) and a set of flows V with 
sources S and terminals T, a linear network mixing design 
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{xy G {0,1}^ : (*,j) G f} is called feasible if the following 
three conditions are satished: 1) Xg^j = Op for source edge 
(sp, j) G £, where Sp G 5 and p G V', 2) Xij = Vki^iiPkij^ki 
for edge (i,j) G £ not outgoing from a source, where i ^ S 
and Pkij G {0,1}; 3) = 1 for all p G Vt and 

Xit,p = 0 for all i G It and p 0 Vt, where t G T- 

Note that Condition 3) in Dehnition 1 ensures that for each 
terminal, the extraneous flows are not mixed with the desired 
flows on the paths to the terminal. In other words, using 
linear network mixing, only mixing is allowed at intermediate 
nodes. This is not as general as using linear network coding, 
which allows both mixing and canceling (i.e., removing one or 
multiple flows from a mixing of flows) at intermediate nodes. 

Given a feasible linear network mixing design (specihed by 
{Pkij G {0,1} : (k,i),{i,j) G £}), one way to accomplish 
mixing when T is large is to use random linear network coding 
[3] (to obtain [akij G T : G £}), as discussed 

in the introduction. Note that, in performing random linear 
network coding based on Pkij, akij can be randomly chosen 
in T when Pkij = 1, but akij must be chosen to be 0 when 

Pkij — 0 . 

III. Continuous Flows with Mixing Only 

In this section, we consider the minimum-cost scalar time- 
invariant linear network coding design problem for general 
connections of continuous flows with mixing only. Starting 
from this section, we consider multiple global mixing vectors 
(each may correspond to multiple global coding vectors) for 
each edge and allow coded symbols to flow through each edge 
at a continuous rate. 

A. Design Parameter 

We consider multiple global mixing vectors for each edge. 
This generalizes the linear network mixing coefficients intro¬ 
duced in [8]. We refer to the number of global network mixing 
vectors for each edge as the mixing parameter, denoted as 
L G {1, • • • , Tmax}, where Lmax is the maximum number of 
global network mixing vectors necessary for decodability using 
mixing (cf. Dehnition 1), and is given as follows. Let y denote 
the set of atoms of the algebra generated by {Vt '■ t G T}, 
i.e., y ^ {nterJ^t ■.yt=Vtoryt=V- Vt} - {0}. In 
other words, y gives a set partition of V that represents the 
Hows that can be mixed (cf. Dehnition 1) over an edge in the 
worst case (i.e., all terminals obtaining hows through the same 
edge). We choose Lmax = li^l- Note that 1 < Lmax < P, 
where L„iax = 1 for the multicast case, i.e., Vt = V for all 
t G T, and L„iax = P for the unicast case, i.e., Vp FVt = % 
for all t t' and t,t' G T. Fig. 2 illustrates an example of 
how partition y and mixing parameter L for the general case. 

For a given mixing parameter L, we now introduce the global 
and local network mixing vectors. For each I = I,-- - ,L, 
let 'K.ij(l) — (Xyp((),*** , Xij^pil), ' ‘ ' ,Xij^p{l)) G {0,1} 
denote the Z-th global network mixing vector over edge (i, j) G 


where 


4 



Fig. 2: Illustration of a feasible solution to Problem l.V = {1, 2, 3}, 
S = {si,S 2 ,S 3 }, Ri = R 2 = Ra = 1, Bij = 2 for all {i,j) G 
r = {fr,f 2 }, Pi = {1,2} and P 2 = {1,2,3}. Thus, y = 
{{1, 2}, {3}}, Ln,ax = |:F| = 2 and L G {1, 2}. 


f. Let G {0,1} denote the local mixing coefficient 

corresponding to the Lth global network mixing vector of edge 
{k,i) G £ (i.e., Xki{l)) and the m-th global network mixing 
vector of edge {i, j) € £ (i.e., (m)), where l,m = 1, ■ ■ ■ ,L. 


B. Problem Formulation 

We would like to find the minimum-cost scalar time-invariant 
linear network coding design with design parameter L G 
{I,- - , Lmax} for general connections of continuous flows 
with mixing only. 

Problem 1 (Continuous Flows with Mixing Only): 


K{l) 


min 




(j.j)Gf 


s.t. 0 < Zij < Bij, {i,j) G £ (1) 

Xij,p{l) e {0,1}, l = ,L, {i,j) G £■, p G P (2) 

Pkijil^rn) G {0,1}, ,L, G £ 

(3) 

fij,p{^) ^ Oj ^ = 1) • • ■ (*ii) € p G Vt, t gT 

(4) 


Ilj,p{l) < tGT, / = 1, • • • ,L, {i,j) G £ 

P&Vt 

(5) 

L 

'^Zij{l) < Zij, {i,j) G £ (6) 

1=1 

L L 

E E -EE fkpii) = 

keOi 1=1 keii 1=1 




iGV, p GVt, 

tGT 

(7) 

/E(0 < 


1-1 

II 

G £, 




pGVt, tGT 


(8) 


Gp , 1 — 1, 

••• ,L, {sp,j) & £, P 

GV 

(9) 


.. ,Lflkii{m,l)yiki{m), 




i = 

I,-- - ,L, {i,j) G£, 

i^S (10) 


0, 1 = 1,- 

• ■ , L, i Git, p ^ Vt, 

tGT 



( 11 ) 


{ Rp , t — Sp 

-Rp, i = t i GV, p GVt, t gT. (12) 
0, otherwise 

In the above formulation,^ fijpi^) ^ 0 can be interpreted 
as the rate of delivering flow p G Vt to terminal t G T 
over edge {i,j) G £ using 'x.ij{l), and Zij{l) denotes the 
transmission rate corresponding to Xy (Z) over edge {i,j) G £, 
where / = 1, • • • ,L. Problem 1 is a mixed discrete-continuous 
optimization problem and is NP-complete in general. For 
notational simplicity, in this paper, we omit the conditions 
{zij : {i,j) G £}, {zij{l) : I = I,-■ ■ ,L,{i,j) G £}, 
■■ I = I,- - ■ G £,p G Vt,t G T}, {xij,p{l) : 

I = I,-- - ,L,{i,j) G £,p G V}, and {Pkij{l,m) : l,m = 
1, • • • ,L, (k, i),{i,j)G£} where there is no confusion. 

Remark 1 (Problem 1 with L = 1 for Multicast): For the 
multicast case (i.e., Vt = P for all f G 70 and L = 1, the 
constraint in (11) does not exist, the constraint in (6) can be 
satisfied by choosing Zij{l) = Zij, and the constraint in (8) 
is always satisfied by choosing /3feij(l,l) = 1 and choosing 
Xij^p{l) according to (9) and (10). Therefore, in the multicast 
case. Problem 1 with L = 1 for general connections reduces 
to the conventional minimum-cost network coding design 
problem for the multicast case [6]. 

Remark 2 (Comparison with Intra-flow Coding): Problem 1 
(with any L G {I,-- - ,Lmax}) with an extra constraint 
e {0>1} for all (i,j) G f and / = 1, • • • , L 
is equivalent to a minimum-cost intra-flow coding problem. 
Thus, the minimum network cost of Problem 1 (with any 
L G {1, • • • , Lmax}) is no greater than the minimum costs 
for intra-flow coding. 

Remark 3 (Comparison with Two-step Mixing): Problem 1 
with L = Lmax and Pkij{l, m) = 1 instead of (3), is equivalent 
to the minimum-cost flow rate control problem in the second 
step of the two-step mixing approach in [7]. Thus, the minimum 
network cost of Problem 1 with L = Lmax is no greater than 
the minimum cost of the two-step mixing approach in [7]. 

Example 1 (Illustration of Linear Network Mixing): We il¬ 
lustrate a feasible mixing design (corresponding to a feasible 
solution) to Problem 1 with L = 2 for the example in Fig. 2. 
For ease of illustration, in this example, we consider unit source 
rate and do not consider flow splitting and coding over time. 
For source edges (1,6), (1,4), (2,7), (2,4) and (3,4), choose the 
global mixing vectors as follows: xiq{1) = Xi 4 (Z) = (1,0,0), 
X24(0 = X27(0 = (0:1)0) and X34(() = (0,0,1) for all 
I = 1,2. In addition, choose the local coding coefficients 
as follows: = /3245(1,1) = /3345(1,2) = 1, 

/3i 45(2,1) = /3245(2,1) = /3345(2,2) = 0, /3i45(m, 2) = 

^Note that (2) with j = t, (7) with i = t, and (8) with j = t imply 
'^i£Xt 1=1 ■■■ L^it,pW — 1 for P ^ Condition 3) of Definition 

1, where t gT. 
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P245{m, 2) = /3345(to, 1) = 0 for all m = 1, 2, /3456(1,1) = 1, 
/3456(2,1) = /3456(1,2) = ^456(2,2) = 0, /3457(1, 1) = 
/3457(2,2) = 1 and /3457(1,2) = ^ 57 ( 2 ,!) = 0. Therefore, 
for edges (4,5), (5,6) and (5,7) not outgoing from a source, 
the global mixing vectors are given by X 45 (l) = (1,1,0), 
X45(2) = (0,0,1), X56(l) = (1,1,0), X56(2) = (0,0,0), 
^ 57 ( 1 ) = (1,1,0) ^ 57 ( 2 ) = (0,0,1). On the other hand, 

flow paths (sets of edge-mixing index pairs for which 

the rates of delivering flows are one) from the three sources, 
i-e-, = 1, ihj) e f, Z = I,-- - ,i} for 

all p € Vt and t € T, aie illustrated using green, blue and 
pink curves in Fig. 2. Accordingly, choose the transmission 
rates as follows: Zij{l) = 1 and Zij{2) = 0 for all {i,j) = 
(1,6), (1,4), (2, 7), (2,4), (3,4), 245 ( 1 ) = ^45(2) = ^ 55 ( 1 ) = 
257 ( 1 ) — 257 ( 2 ) — 1 , 255 ( 2 ) — 0 , and Zij — 22 ^( 1 ) -t- Zij{2^ for 
all {i,j) e £. 

The following lemma shows the existence of a feasible linear 
network code corresponding to Problem 1. 

Lemma 1: Suppose Problem 1 is feasible. Then, for each 
feasible solution, there exists a feasible linear network code 
with a field size F > T to deliver the desired flows to each 
terminal. 

Proof: Please refer to Appendix A. ■ 

Note that a feasible linear network code can be obtained from 
a feasible linear network mixing design (a feasible solution 
to Problem 1) using random linear network coding [3], as 
illustrated in Section II-B. 

Example 2 (Illustration of Linear Network Coding): We il¬ 
lustrate how to obtain a feasible linear network code using 
random linear network coding, based on the feasible linear net¬ 
work mixing design illustrated in Example 1. In this example, 
one local mixing coefficient (global mixing vector) corresponds 
to one local coding coefficient (global coding vector).^ For 
the source edges, choose the global coding vectors as follows: 
c,,(0 =Xy(0 for all (i,j) = (1,6), (1,4), (2, 7), (2,4), (3,4) 
and I = 1,2. For all l,m = 1, • • ■ ,L and {k,i),{i,j) € £, 
if I3kij{l,m) = 0, choose = 0; if I3kij{l,m) = 1 , 

choose akij{l, m) uniformly at random from F. Therefore, for 
the edges not outgoing from a source, the global coding vectors 
are given by (/) = X^fcei m=i - l l)cki{m) for all 

(*,j) = (4,5),(5,6),(5,7)andZ = i,..-,T. 

C. Network Cost and Complexity Tradeoff 

The design parameter L in Problem 1 determines the com¬ 
plexity and network cost tradeoff. First, we illustrate the impact 
of L on the complexity of Problem 1. By (3), we know that 
for given {k,i),{i,j) £ £, the cardinality of {I3kij{l,m) : 
/,m = I,-- - ,L} is Since Ei^,J)eeOJ = < 


DOj = DE, the cardinality of {/?fey (Z,m) : I, m = 
1, ■ • • ,L, {k,i), {i,j) £ £"} is smaller than or equal to LfDE. 
Note that by (9) and (10), {xij^p{l)} can be fully determined by 
{Pkijif iTi)}. Therefore, the cardinality of the discrete variables 
{xij,p{l)} and {fkijifiTi)} of Problem 1 is LfDE, which 
increases as L increases. 

Next, we discuss the impact of L on the network cost. 

Lemma 2: If Problem 1 is feasible for design parameter L, 
then Problem 1 is feasible for design parameter L -t- 1 and 
U*{L -f 1) < U*{L), where L =!,■■■ , Lmax - 1- 


Proof: Given a feasible solution to Problem 1 with design 
parameter L, by setting variables w.r.t. index I = L + 1 or 
m = L+1 to be zero, we can easily construct a feasible solution 
to Problem 1 with design parameter L-l-1. This feasible solution 
corresponds to the same network cost as the one with design 
parameter L. But the network cost with design parameter L + 1 
can be further optimized by solving Problem 1 with design 
parameter L + 1. Therefore, we can show U*{L -b 1) < U*{L) 
for all L — l,*‘-,T/max 1- ® 

By Femma 2, we know that the network cost U*{L) is 
non-increasing w.r.t. L. This can also be understood from the 
example in Fig. 2. Note that by Condition 3) in Definition 1, 
flow 3 is not allowed to be mixed with flow 1 and flow 2 on their 
paths to terminal ti. When L = 1 < Tmax, flow 3 cannot be 
delivered over edge (4, 5) to terminal t 2 using feasible mixing. 
In other words. Problem 1 with L = 1 is not feasible (i.e., of 
infinite network cost). However, when L = 2 = Lmax, flow 
3 can be delivered to terminal t 2 without mixing with flow 1 
and flow 2 over edge (4, 5), e.g., using global mixing vectors 
X 45 (l) = (1,1,0) and X 45 ( 2 ) = (0,0,1) over edge (4,5). In 
other words. Problem 1 with L = 2 is feasible (i.e., of finite 
network cost). Thus, we can see the impact of L on the network 
cost shown in Femma 2. 


IV. Alternative Formulation with Discrete Mixing 

Problem 1 is a mixed discrete-continuous optimization prob¬ 
lem with two main challenges. One is the choice of the 
network mixing coefficients (discrete variables), and the other 
is the choice of the flow rates (continuous variables). In this 
section, we first propose an equivalent alternative formulation 
of Problem 1 which naturally subdivides Problem 1 according 
to these two aspects. Then, we propose a distributed algorithm 
to solve it. 


A. Alternative Formulation 

Problem 1 is equivalent to the following problem. 
Problem 2 (Equivalent Problem of Problem 1): 


^Note that when flow splitting or coding over time happens, one local mixing 
coefficient (global mixing vector) may con'espond to multiple local coding 
coefficients (global coding vectors). In this case, a linear network code can be 
designed in a similar' way based on the sub-flows and sub-edges established in 
the proof of Lemma 1. 


where U*{{xij^p{l)}) and AT(L) are given by the following 
two subproblems. 
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Subpmblem 1 (Subproblem of Problem 2: Flow Optimization).in a more compact form as follows: 

For given {xij^p{l)}, we have: , 

{xfej(m) : m = 1, • • • ,L,k€ IJ, 


Uxi{xij,pil)}) = min 

s.t. (1), (4), (5), ( 6 ), (7), ( 8 ) 


E 


{i,3)e£ 


{xfcj(m) : m =!,■■■ ,L,k€lj,j € T}^ 

1, if j ^T, (14) holds 
1, if j e r and p G Vj, (14) and (13) hold 
1, if j gT and p ^ Vj, (14) and (11) hold 


Subproblem 2 (Subproblem of Problem 2: Feasible Mixing): 
Find the set A4(X) ^ {{x,,,p(0} : (2), (3), (9), (10), (11), (13)} 
of feasible {xij^p{l)}, where (13) is given by: 

,L Xit^p{l) = f, pGVt, IgT. (13) 


Note that for given {xij^pfi)}. Subproblem 1 is a convex 
optimization problem (involving continuous flow rates) and 
hence has polynomial-time complexity. On the other hand, Sub¬ 
problem 2 is a discrete feasibility problem (involving discrete 
mixing coefficients) and is NP-complete in general. Therefore, 
Problem 2 is still a mixed discrete-continuous optimization 
problem and is NP-complete in general. 


B. Distributed Solution 


}0, otherwise 

l = f...^L,{i,3)G£,pGV,i^S. (15) 

Note that, when j ^ T, {xfej(r 7 i) : m = 1, • • • ,L,k G 1.j,j G 
T} = 0 and we ignore it in the clause (j\fp{-). For (13) and 
( 11 ) in clause (j\j p{-), we use j as the terminal index instead 
of t. It can be seen that the constraints in (10) (i.e., (14)), 
(11) and (13) are considered in clause 4>ifp{-). In addition, the 
constraint in (9) is considered when choosing Xg^jil) = Bp for 
all I = 1, • • • , L, {sp,j) G S and p G V. Therefore, the CSP 
has considered all the constraints in Subproblem 2. 

We now construct the clause partition of Subproblem 2. 
Specifically, the set of clauses variable Xipp(l) participates in 
is as follows: 

*5,P = Iff] u ^ = 1. ■ • ■ , i. t € Oj} 

U (K’j’,1 ; m = 1, - ■ - ,L, A- 6 Xj.i e t| 

I = f,-■ ■ ,L, (ij) G £, p GV, i ^ S. (16) 


In this part, we develop a distributed algorithm to solve 
Problem 2 by solving Subproblem 1 and Subproblem 2, respec¬ 
tively, in a distributed manner. First, we consider Subproblem 2. 
Subproblem 2 can be treated as a CSP and solved distributively 
using clause partition and the Communication-Free Learning 
(CFL) algorithm from [11]. While CSPs are NP-complete in 
general, CFL provides a probabilistic distributed iterative algo¬ 
rithm with almost sure convergence in finite time. Specifically, 
{xij,p{l)} U {Pkij{l, "m)} can be treated as the variables of the 
CSP. {0,1} can be treated as the finite set of the CSP. From 
( 10 ), we have an equivalent constraint purely on {xij^p{l)}, i.e., 

3 Pkijim, 1) G {0,1} Vfc e li, m = 1, • • • ,L, 
s.t. 'K.ij(l) = V/cGXi,m—I,---(nr, ^)x;c 2 (ttT.), 

1 = 1,■■■ ,L, (i, j)g£, i^S. (14) 

In the following, we shall only consider solving for the 
variables {xij^p(l)} of the CSP in a distributed way using 
clause partition and CFL, as {Pkijih n^)} can be obtained from 
feasible {xipp{l)} by (9) and (10). In addition, we directly 
choose Xs^j{l) = Bp for all Z = 1, • • • , L, {sp,j) G £ and 
p GV according to (9). 

For notational simplicity, we write the clauses for {xij^pfl)} 


Note that, when j ^ T, 

{jf'pp '■ m = 1, - ■ ■ ,L,k Glj,j gT^ =0 and we ignore it 

in in ( 16 ). 

Based on the clause partition, a feasible \.Xij,p{l)} G .A^(P) 
to Subproblem 2 can be found distributively using the proba¬ 
bilistic distributed iterative CFL algorithm [11, Algorithm 1]. 
Specifically, for all (i,j) G £, p G V and I = I,-- - ,L, 
in each iteration, each node i realizes a Bernoulli random 
variable selecting Xipp(l)', messages on {xij^p{l)} are passed 
between adjacent nodes for each node i to evaluate its related 
clauses in (16); based on the evaluation, each node i updates 
the distribution of the Bernoulli random variable selecting 
Xipp(l). Given a feasible {xij^p{l)} G M.{L) obtained by CFL, 
Subproblem 1 is convex and can be solved distributively using 
standard convex decomposition. We omit the details here due 
to the page limitation. 

Now, we can develop a distributed algorithm to solve Prob¬ 
lem 2 based on CFL and convex decomposition, as briefly 
illustrated in Algorithm 1.^ 

'*In Step 3, CFL is run for a sufficiently long time. Step 4 (Step 6) 
can be implemented with a master node obtaining the network convergence 
information of CFL (network cost) from all nodes or with all nodes computing 
the average convergence indicator of CFL (average network cost) locally via a 
gossip algorithm. 


Algorithm 1 Algorithm for Problem 2 

1 : initialize n = 1 and Ui = +oo. 

2 : loop 

3: Run CFL to the CSP corresponding to Subproblem 2. 

4: if the CFL finds a feasible solution then 

5: For the obtained feasible solution to Subproblem 2, 

solve Subproblem 1 distributively using convex de¬ 
composition. Let Un denote the corresponding net¬ 
work cost. 

6: if Un < Un then 

7: set Un +1 = Un and n = n + 1. 

8 : end if 

9: end if 

10 : end loop 


Based on the convergence result of CFL [11, Corollary 2], 
we can easily see that Un U* {L) almost surely as n —>■ cxd, 
if Problem 2 is feasible. 


V. Alternative Formulation with Continuous 
Mixing 

The complexity of solving Problem 2 mainly lies in solving 
for the network mixing coefficients (discrete variables) in 
Subproblem 2. In this section, we first propose an equivalent 
alternative formulation of Problem 1 (Problem 2) with continu¬ 
ous mixing. Then, we elaborate on some distributed algorithms 
to solve it. 


A. Alternative Formulation 


Problem 1 (Problem 2) is a mixed discrete-continuous opti¬ 
mization problem. Applying continuous relaxation to (2) and 
(3) and manipulating (10), we obtain the following continuous 
optimization problem. 

Problem 3 (Continuous Formulation of Problem 1): 


K{L) 


min 




(Li)e£ 


s.t. (1), (4), (5), ( 6 ), (7), (9), (11) 


Xij,p{l) e [0,1], / = 1,• • • ,L, (j,j) e pcV (17) 

Pkijifm) e [0,1], l,m = I,--- ,L, c£ 

(18) 

fij,pi0 — Xij,p{l)Bij, Z = 1 , • • • , L, {i,j) C £, 

pevt, ter (19) 

Xij ^pipUj ^ Pkij xn^Xki^pi^lf k e Tii, I — 1, * * • ,7/ 

m = ,L, {i,j) eS^peV (20) 

Xij^plpn) < E Pkij if XffjXki^piX)-i 

,L 

m = lifU), {i,j) e£,peV (21) 
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Note that Constraints (17) and (18) in Problem 3 can be 
treated as the continuous relaxation of Constraints (2) and 
(3) in Problem 1. Constraint (19) in Problem 3 corresponds 
to Constraint ( 8 ) in Problem 1. Constraints (20) and (21) in 
Problem 3 can be treated as the continuous counterpart of 
Constraint (10) in Problem 1. The following lemma shows 
the relationship between Problem 1 (mixed discrete-continuous 
optimization problem) and Problem 3 (continuous optimization 
problem). 

Lemma 3 (Relationship between Problem 1 and Problem 3): 
(i) If {zij},{z,j{l)},{f*^p{l)},{x,j,p{l)},{/3kij{l,rn)} 
is a feasible solution to Problem 1, then 

{zij}, {Zij(^l)}, p(0}: m)} 

is a feasible solution to Problem 3, where 

Und (^l^ TTl'j — (I^ TJl^, 

if {%}> {Pkijil, m)} 

is a feasible solution to Problem 3, then 

{zij}, {%(()}, {/ 4 p( 0 }: {xij,pil)}^ m)}js a feasible 

solution to Problem 1, where Xij^p{l) = \xij,p{l)^ and 
I3kij{l,'m) = \Pkij{f'm)~\. (ii) The feasibilities of Problem 1 
and Problem 3 imply each other, (iii) The optimal values of 
Problem 1 and Problem 3 are the same, i.e., U*{L) = U*{L). 

Proof: Please refer to Appendix B. ■ 

By Lemma 3, solving Problem 1 is equivalent to solving 
Problem 3. 

B. Distributed Solution 

Problem 3 is a (pure) continuous optimization problem. It is 
not convex due to Constraints (20) and (21). Several penalty 
methods [ 12 ] can be applied to find a local minimum of 
Problem 3 with polynomial-time complexity. Those methods 
can also be implemented in a distributed manner using standard 
decomposition. On the other hand, by weak duality [12], dual 
method can be applied to find a lower bound of the global 
minimum value of Problem 3. The difference between an 
obtained local minimum and this lower bound can serve as 
an upper bound on the performance gap between the local 
minimum and the global minimum of Problem 3. 

VI. Conclusion 

In this paper, we considered linear network code construc¬ 
tions for general connections of continuous flows to minimize 
the total cost of edge use based on mixing. To solve the 
minimum-cost network coding design problem, we proposed 
two equivalent alternative formulations with discrete mixing 
and continuous mixing, respectively, and developed distributed 
algorithms to solve them. Our approach allows fairly general 
coding across flows and guarantees no greater cost than any 
solution without inter-flow network coding. 

Appendix A; Prooe oe Lemma 1 

First, we consider L = 1. We omit the index terms (1) 
and (1,1) behind the variables for notational simplicity. Let 
{zij}, {xij^p}, {(3kij} and {fij^p} denote a feasible solution 







to Problem 1. We shall extend the proof of Lemma 1 in [8] 
for the integer flows {fijp G {0,1}) and unit source rates 
{Rp = 1) with one global coding vector over each edge 
{Zij G jo, 1}) to the general continuous flows iflj^p G [0, Bij]) 
and source rates (Rp G R+) with multiple global coding 
vectors (z^j G [0, Bij]) over each edge. In the general case, 
we code over time n > 1. For all p G V, convert source 
p with source rate Rp over time n to \nRp\ unit rate sub¬ 
sources pi,-- - ,PinRp\ - For each edge (i,j) G E, allow the 
total number of the sub-flows of flow p G 'P* to terminal 
f G T to be fewer than or equal to Therefore, the 

flow path of flow p can be decomposed into \nRp\ unit rate 
sub-flow paths pi, • • • ,PinRp\ from source p G P* to terminal 
t G T. The sum rate of unit rate sub-flows of flow p over 
edge {i,j) G S is less than or equal to [ri/Fp]. The sum rate 
of unit rate sub-flows of all the flows over edge (t,j) is less 
than or equal to Zij = rnax^gT-[n/f . Decompose 
edge {i,j) into sub-edges. Let sub-flows to terminal t pass 
different sub-edges, i.e., each sub-edge transmit at most one 
sub-flow to terminal t. We have now reduced the general case 
to the special case considered in Lemma 1 in [8]. Therefore, we 
can show that there exists a feasible linear network code over 
time n. The associated average sum transmission rate over edge 
{i,j) is Zijin. Note that Zij jn — Zij jn < Pjn. Therefore, this 
code design can achieve the minimum cost U*{1) by taking n 
arbitrarily large. 

When L > 1, we can convert each edge (*,j) G £ into L 
edges. Then, we can apply the above proof for L = 1 to the 
equivalent constructed network. 

Appendix B: Proof of Lemma 3 

It is obvious that (i) implies (ii). Next, we show that (i) 
implies (iii). Suppose (i) holds, which indicates that each 
{zij} associated with a feasible solution to Problem 1 is also 
associated with a feasible solution to Problem 3, and vice versa. 
In addition, jzy } fully determines j)^E (%)■ Thus, the 
set of feasible network costs to Problem 1 is the same as that 
to Problem 3, implying the optimal values of the two problems 
are the same. Therefore, we can show that (i) implies (iii). 
Thus, to show Lemma 3, it is sufficient to show (i). Note that 
in the proof, we only need to consider the different constrains 
between Problem 1 and Problem 3. 

To show (i), we first show that when e {0,1} 

and I3kij{l,m) G {0,1}, Constraint (10) is equivalent to the 
following two constraints in (22) and (23). 

^ [^kij (^, Rkl)xki^p(l) t k G Ti, I — 1, • * • J L 
m = 1, • • ■ ,L, Ii ^ 0, (ij) G £,p G V (22) 

Xij^p(m) < E i^kij (^, (0; 

kGXi 1,... ,L 

m = !,■■■ ,L, Ii ^ 0, (iJ) G £,p G V (23) 

Note that Constraints (10), (22) and (23) are for all m = 
I,-- - ,L, Xi ^ 0, (i,j) G £ and p G V. Thus, we prove 


this equivalence by considering the following two cases for any 
m = 1, ■ ■ ■ , L, Ii ^ 0, {i,j) G £ and p gV. First, consider 
the case where Pkij{l,m)xki,p{l) = 0 for all k G Ii and 
I = 1, • • • ,L. Constraint (10) implies that Xij^p(m) = 0, and 
Constraints (22) and (23) also imply that Xij^p{m) = 0. Second, 
consider the case where there exists at least one pair {k,l), 
where k Gli and Z = 1, • • • , L, such that Pkijih 'n^)xki,pil) = 
1. Constraint (10) implies that Xij^p{m) = 1, and Constraints 
(22) and (23) also imply that Xij^p{m) = 1. Note that under the 
integer constraints Xij^p{l) G {0,1} and /?fey (Z,m) G {0,1}, 
the above two cases are the only two possible cases. Therefore, 
we can show Constraint (10) is equivalent to Constraints (22) 
and (23). 

Next, we show that the first statement of (i) holds. Suppose 
{zij}, {zij{l)}, {fkp{l)}^ {xij pjl)}^ {/3,^^j{l^m)} is a feasible 
solution to Problem 1. Let Xij^p{l) = Xij^p{l) G {0,1} 
for all Z = I,-- - ,L, (*,j) G £ and p G V, and 
Pki]il,rn) = /3kijil,m) G {0,1} for all k G Xi, Xi 
0, (lj) G ^ l,m = I,-- - ,L. Since Constraints 
(17), (18) and (19) in Problem 3 can be treated as the 
continuous relaxation of Constraints (2), (3) and (8) in 
Problem 1, {f!j^p{l)}, {xij,p{l)}, {Pkij{l,m)} satisfles Con¬ 
straints (17), (18) and (19). In addition, since Constraint 
(10) is equivalent to Constraints (22) and (23), and Con¬ 
straints (20) and (21) can be treated as the continuous re¬ 
laxation of Constraints (22) and (23), {xij^p(l)}, {Pkij{l',rn)} 
satisfles Constraints (20) and (21). Therefore, we can show 
{zij}, {zij{l)}, {fkp{l)}^ {xij^p(l)}, {Pk^jil,m)} is a feasible 
solution to Problem 3. 

Finally, we show that the second statement of (i) holds. 
Suppose {zij}, {Zij(l)}, {fkp(l)}, {xij,p(l)}, {Pki]il,rn)} is a 
feasible solution to Problem 3. Let Xij^p{l) = \xij,p{l)'] for 
all / = I,-- - ,L, {i,j) G £ and p G V, and f3kij{ 1 , 1 x 1 ) = 
\Pkij{l,m)] for all k G Xi, Xi ^ 0, {i,j)_ G £ and 
l,m = 1, - ■ ■ ,L. In other words, if Xij,p{l) = 0 (f3kij{l, m) = 
0), then Xij,p{l) = 0 (l3kij{l,m) = 0); if Xij,p{l,m) G 
(0,1] ((3kij{l,m) G (0,1]), then Xij,p{l) = 1 (I3kij{l,m) = 
1). It is obvious that {flj^p{l)},{xij,p{l)},{Pki]{l,m)} sat¬ 
isfles Constraints (2), (3) and (8). It remains to show 
{xij,p{l)},{l3kij{l,xn)} satisfles Constraint (10). Note that 
Constraint (10) is for all m = I,-- - ,L, Xi 7^ 0, {i,j) G £ 
and p G V. Thus, similarly, we prove this result by consid¬ 
ering the following two cases for any m = I,-- - ,L, Xi ^ 
0, {i,j) G £ and p G V. First, consider the case where 
Pkij{l,m)xki,p{l) = 0 for all k G Xi and / = I,-- - ,L. 
Constraints (20) and (21) imply that Xij^p{m) = 0, and 
hence, we have Xij^p{m) = \xij^p{m)'] = 0. In addition, 
l3kij{l,ixi)xki,p{l) = 0 for all k G Xi and I = I,-- - ,L 
also implies Pkij {I, iri)xki,pil) = \l3kijil,m)]\xki,pil)] = 

0 for all k G Xi and I = I,-- - ,L. Thus, in this case, 
we can show {xij^p{l)}, {Pkijiijixi)} satisfles Constraint (10). 
Second, consider the case where there exists at least one 
pair {k,l), where k G Xi and I = I,-- - ,L, such that 


9 


Pkij{l,m)xki,p{l) € (0,1]. Constraints (20) and (21) together 
with Constraints (17) and (18) imply that Xij^p(rn) € (0,1], 
and hence, we have Xij^p{m) = \xki,p{l)~\ = 1- In addition, 
Pkij{l,'m)xki,p{l) S (0,1] together with Constraints (17) and 
(18) also imply ^kzj{l,m)xki,p{l) = \Pkij{l,m)']\xki,p{l)] = 
1. Thus, in this case, we can show {xij^p{l)}, {^kijih'iT^)} 
satisfies Constraint (10). Note that under the continuous con¬ 
straints Xij^p{l) G [0,1] and jSkijil^rn) € [0,1], the above two 
cases are the only two possible cases. Therefore, we can show 
{zij},{zij{l)},{flj^p(l)},{xij^p(l)},{Pkij(l,rn)} is a feasible 
solution to Problem 1. 

Therefore, we complete the proof of Lemma 3. 
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